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Abstract 

Tomograms, a generalization of the Radon transform to arbitrary 
pairs of non-commuting operators, are positive bilinear transforms 
with a rigorous probabilistic interpretation which provide a full char- 
acterization of the signal and are robust in the presence of noise. To- 
mograms based on the time-frequency operator pair, were used in the 
past for component separation and denoising. Here we show how, by 
the construction of an operator pair adapted to the signal, meaningful 
information with good time resolution is extracted even in very noisy 
situations. 
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1 Introduction 



1.1 Integral transforms: linear, bilinear and tomograms 

Integral transforms [T] [2] are very useful for signal processing in communi- 
cations, engineering, medicine, physics, etc. Linear and bilinear transforms 
have been used. Among the linear transforms, Fourier [3] and wavelets [I] 
[5] [B] are the most popular. Among the bilinear ones, the Wigner-Ville qua- 
sidistribution [7] [S] is the most commonly used to provide information in the 
joint time-frequency domain. A joint time-frequency description of signals is 
indeed important, because in many applications (biomedical, seismic, radar, 
etc.) the signals are of finite (sometimes very short) duration. However, the 
oscillating cross-terms in the Wigner-Ville and other quasidistributions jH] 
[TU] [TTj make the interpretation of these transforms a difficult matter. Even 
when the average of the cross-terms is small, their amplitude may be greater 
than the signal in time-frequency regions that carry no physical information. 

The difficulties with the physical interpretation of quasidistributions arise 
from the fact that time and frequency correspond to two noncommutative 
operators. Hence a joint probability density can never be defined. Even 
in the case of positive quasiprobabilities like the Husimi-Kano function [12] 
[13] . an interpretation as a joint probability distribution is also not possible 
because the two arguments of the function are not simultaneously measurable 
random variables. 

Recently, a new type of strictly positive bilinear transform has been pro- 
posed [2] [IB] [TB], called tomogram, which is a generalization of the Radon 
transform [T7] to arbitrary noncommutative pairs of operators. The Radon- 
Wigner transform [18] [19] is a particular case of such noncommutative to- 
mography technique. The tomograms are strictly positive probability densi- 
ties, provide a full characterization of the signal and are robust in the presence 
of noise. 

A unified framework to characterize linear transforms, quasidistributions 
and tomograms was developed in Ref.[15j. To fix notation we briefly review 
it here. Signals f(t) are considered as vectors |/) in a subspace A^ of a 
Hilbert space "H with dual space A/"*. Then a family of unitary operators 
U (a) = e lB( * a \ a being a label {a G I, I C W 1 }, is defined on J\f*. Using a 
ket-bra notation we denote |/) G Af and (/| G Af*. In this setting three types 
of integral transforms are constructed. Let h G A/"* be a reference vector and 
let U be such that the linear span of {U(a)h G Af* : a G /} is dense in Af*. 
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In {U(a)h}, a complete set of vectors can be chosen to serve as basis. 

1 - Linear transforms 

wf l \a) = (U{a)h\f) (1) 

2 - Quasi-distributions 

Q f (a) = (U(a)f\f) (2) 

3 - Tomograms 

Given an unitary U (a) = e tB( ~ a \ B (a) has the spectral projection B (a) = 
J XP (X) dX. Let 

P(X) = \X)(X\ 

be the projectoiQ on the (generalized) eigenvector (X\ G ftf* of B (a). The 
tomogram is 

Mf\x) = (f\P(X) \f) = (f\ X) (X \f) = \(X \f}\ 2 (3) 

The tomogram M^ B \x) is the squared amplitude of the projection of the 
signal \f) G M on the eigenvector (X\ G A^* of the operator B (a). Therefore 
it is positive. For normalized | /), 

(/ I /} = 1 

the tomogram is normalized 

J M { f B) (X) dX = 1 (4) 

and may be interpreted as a probability distribution on the set of generalized 
eigenvalues of B (a), that is, as the probability distribution for the random 
variable X corresponding to the observable defined by the operator B (a). 
The tomogram is a homogeneous function 

Mf /P \X) = \p\Mf\pX) (5) 



1 Another convenient notation for the projector on a generalized eigenvector of B (a) 
with eigenvalue X is 

5(B(a) -X) = P{X) 
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Examples: 

If U (a) is unitary generated by B F ( c^") = a\t + %ot 2 jg and h is a (gen- 
eralized) eigenvector of the time-translation operator the linear transform 
(a) is the Fourier transform. For the same B F (~ce), the quasi-distribution 
Qf(cx) is the ambiguity function and the Wigner-Ville transform [7] [8] is the 
quasi-distribution Qf(a) for the following B— operator 

B<W{a u a 2 ) = -VIolA - 2a 2 t + - ^ f J- (6) 

The wavelet transform is Wi (a) for B w (73*) = ct\D + %ol 2 ^ v D being 
the dilation operator D = — | (it 4 + i|a The wavelets h ST {t) are kernel 
functions generated from a basic wavelet /i(r) by means of a translation and 
a rescaling (— oo < r < oo, s > 0): 

using the operator 

U^ a \t,s) = exp(iTu) exp(i log s.D), (8) 

h 8iT (t) = U^(r,s)h(t). (9) 

The Bertrand transform [20] [21] is the quasi-distribution Qf(cn) for 
Linear, bilinear and tomogram transforms are related to one another (see 

USD- 

1.2 Tomograms: Some examples 

As shown above, tomograms are obtained from projections on the eigenstates 
of the B operators. These operators may be linear combinations of different 
(commuting or noncommuting) operators, 

B = fiO l + v0 2 

meaning that the tomogram explores the signal along lines in the plane 
(Oi,0 2 ). For example for 

n / \ ■ & 

B (/i, v) = fit + Vbj = fit + iv-r 
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the tomogram is the expectation value of a projection operator with support 
on a line in the time-frequency plane 



X = fit + uu (10) 

Therefore, M { / ] (X, is the marginal distribution of the variable X along 
this line in the time-frequency plane. The line is rotated and rescaled when 
one changes the parameters fi and v. In this way, the whole time-frequency 
plane is sampled and the tomographic transform contains all the information 
on the signal. Instead of marginals collected along straight lines on the time- 
frequency plane, one may use other curves to sample this space [IB] . 

Tomograms associated to the generators of the conformal group: 

Time-frequency 

B 1= ^t + iu^- (11) 



Time-scale 
Frequency-scale 

Time- conformal 



dt 



D - = I' 1 '- n/ [ t J t + ^) {U) 



B3= ^ + n^> (13) 



D\ nt -f /V ( t2 ^ + t ) < 1 J) 



The construction of the tomograms reduces to the calculation of the gen- 
eralized eigenvectors of each one of the Bi operators 
Bi^i (//, is, t, X) = Xif)! (/i, u, t, X) 

■ ( tx\ 

ipi {fJ', v, t, X) = exp i I — — 1 (15) 

with normalization 

dtipl (/i, u, t, X) V>i (/i, v, t, X') = 27tu5 {X - X') (16) 



B 2 ip 2 (//, v, t, X) = Xip 2 (a*, v, t, X) 



il)2(fJ.,v,t } X) = —^=expi ( — - — log |t| ) (17) 

\t\ V v v 
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J dti)* 2 fa, v, t, X) i) 2 (fx, v, t, X') = Attu5 (X - X') (18) 
B 3 *jj 3 (/i, v, u, X) = X^ 3 (a*, v, w, X) 



' fjL X 

^ 3 (fx, v, t, X) = exp (-i) ( -u - — log \u\ 



, , >. 1 . / X u . 

^ 4 (Li,v,t,X) = j-expi — + - log * 

y (A*, t, s) ^4 (/i, Z/, t, s') = 27TZ/5 (s — s') 



Then the tomograms are: 
Time-frequency tomogram 

M 1 {li, v, X) 



1 



27T 1/ 



exp 



ifit 2 itX 
~2v V 



f(t)dt 



Time-scale tomogram 
M 2 (n,v,X) 
Frequency-scale tomogram 

i 

M z (n,v,X) = 



2tt\u\ 



fit) Jj/Mt.!! 

P L V v v & 



(it -H=e 
t 



27T M 



/H r-i(^-^iogM)] 



/(w) being the Fourier transform of /(£) 
Time-conformal tomogram 



M 4 (n,v,X) 



2ir\v\ 



dt 



M e m+^\t\)\ 



(19) 



J dwipl (A*, ^ w, -X") ^i (A*, w, X') = 2ttz/(5 (X - X') (20) 

£4^4 (/i, V, t, X) = X^4 (A*, V, X ) 



(21) 
(22) 



(23) 



(24) 



(25) 



(26) 



The tomograms Mi, M 2 and M 4 interpolate between the (squared) time signal 
(v = 0) and its projection on the (fi, v, t, X) functions for fi — 0. 
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In a similar way, tomograms may be constructed for any operator of the 
general type 

„ . ( , s d Ida (t) 

the generalized eigenvectors being 

i[) g (n,v,t,X) = \g(t)\~ 1/2 expi (~ J 



ds a 
+ - 



sds 



g(s) uj g(s) 

When dealing with finite-time signals and finite-time tomograms some 
normalization modifications are needed. For example, for a time-frequency 
tomogram, instead of f l23|) . we consider the finite-time tomogram, for a signal 
defined from t to to + T 



M x {d,X) 



t +T 2 

r(t)4 l Ut)dt 



to 



\<f,4 1] > 



with 



4% (t) 



exp 



1 cos 9 

2 sin^ 



t 2 



iX 
sin# 



(27) 



(28) 



and fi = cos6, v = sin 9. 9 is a parameter that interpolates between the time 
and the frequency operators, running from to tt/2 whereas X is allowed to 
be any real number. An orthonormalized set of 4ex W vec tors is obtained 
by choosing the sequence 



2rm 

X n = X + — sin9 



neZ 



(29) 



Likewise for the finite-time time-scale tomogram M 2 (/i, z/, X) (Eq EMI) and 
the finite-time time-conformal tomogram M 4 (/i, u, X) (Eq j26|) : 



M 2 (9,X) 



to+T 2 

r{t)^ x {t)dt 



to 



4% (t) 



i 



i 



^oglto + T] — log |t | 
2nn 



exp i 



cos 9 
sin 9 



<f,4 2) >\ 
x 



t 



log [t 



X n — Xq + 



log |t + T| — log |t | 



sin# 



sin 9 



(30) 

(31) 
(32) 
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and 



M A {B,X) 



t +T 

(4) 

J \ V JV, 

to 



2 



</,^ (4) >r (33) 



(4) U \_ Jt (t + T) 1 JcosO ,_ U| , X 



= V^V^w expi l^ log|t| + ^J (34) 

X n = X + - ^° + T ^ 27m sin 9 n G Z (35) 



2 Applications of tomograms: Denoising, and 
component separation 

Most natural and man-made signals are nonstationary and have a multicom- 
ponent structure. Therefore separation of its components is an issue of great 
technological relevance. However, the concept of signal component is not 
uniquely defined. The notion of component depends as much on the observer 
as on the observed object. When we speak about a component of a signal 
we are in fact referring to a particular feature of the signal that we want to 
emphasize. For signals that have distinct features both in time and in the 
frequency domain, the time-frequency tomogram is an appropriate tool. 

Consider finite-time tomograms as in (127)) . For all different #'s the U(8), 
of which B (8) is the self-adjoint generator, are unitarily equivalent operators, 
hence all the tomograms share the same information. 

First we would select a subset X n in such a way that the corresponding 

family j^ 1 ]^ (^) j is orthogonal and normalized, 

< <iXL >= <W (36) 

This is the sequence listed in (129 p . where X Q is freely chosen (in general we 
take X = 0). We then consider the projections of the signal f(t) on this set 
of ipf^ vec tors 

4„ (/)=</, ^1 > (37) 
Denoising consists in eliminating the c e Xn (/) such that 

|4„(/)f<^ (38) 
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for some threshold e. This power selective denoising is more robust than, 
for example, frequency filtering which may also eliminate important signal 
information. 

The component separation technique is based on the search for an inter- 
mediate value of 9 where a good compromise might be found between time 
localization and frequency information. This is achieved by selecting subsets 
J~k of the X n and reconstructing partial signals (fc-components) by restricting 
the sum to 

/*(*) = E c xSf)^e,x n {t) (39) 

for each k. 

3 Signal detection with an adapted operator 
pair 

Time-frequency tomograms have been used for denoising and component 
separation of finite-time signals [2] [22] [23] [21] [25]- Time-frequency to- 
mograms are particularly appropriate to identify the time unfolding of the 
frequency features of the signals. For example, the component separation 
success [22J [23J in the plasma reflectometry applications is to a large extent 
due to the fact that the plasma is sampled by microwave chirps and the basis 
in ( 128]) is exactly a chirp basis. This suggests that, for other types of signals, 
other types of tomograms should be chosen. 

In particular, if in the linear combination B (/x, v) = fit+uO, one chooses 
an operator O, that is specially tuned to the features of the signal that one 
wants to extract, then, by looking for the particular values of the set (/i, v) 
where the noise effects might cancel out, we may separate the information 
of very small signals from large noise and also obtain reliable information on 
the temporal structure of the signal. This would provide a signal-adapted 
filtering technique. The construction of the operator suited to particular 
signals may be done by the same techniques that are used in the bi-orthogonal 
decomposition [26J. 

The method for the construction of the adapted operator pair is as follows: 
Consider a set of iV— dimensional time sequences {xt, ■ • • , xt}, typical of 
the signal one wants to detect. For a communication point of view these 
may be considered as the code words that, later on, one wishes to detect in 
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a noisy signal. Form the k x N matrix U G M k 



xN- 



U 



( ari(lAt) xi(2Ai) ... x^NAt) 
\ x k (lAt) x k (2At) ... x k (NAt) 



(40) 



with k < N typically. 

Now construct the square matrices A = U T U E Mnxn and B = UU T E 
Aikxk- The diagonalization of A provides k non-zero eigenvalues (a±, • • • , a k ) 
and its corresponding orthogonal N— dimensional eigenvectors ($1, ■ • • , 



G 



Correspondingly, the diagonalization of B would provide the same 



k eigenvalues and eigenvectors • • • ,^/ k ) with ^ el If needed one may 
obtain, by the Gram-Schmidt method, the remaining N — k eigenvectors to 
span WL N , which in this context are associated to the eigenvalue zero. 
The linear operator S constructed from the set of typical signals is 



(41) 



where S G A^atxA^- 

For the tomogram we consider an operator B (/x, is) of the form 



/ lAt 



B (/i, v) = jit+uS = \i 



\ 



2At 



3At 



\ 



NAt 



+vJ2 a &®t ( 42 ) 



i=i 



where B G MnxN- 

The eigenvectors of each B (fi,u) are the columns of the matrix that 
diagonalizes it. From the projections of a signal on these eigenvectors one 
constructs a tomogram adapted to the operator pair (t, S). 



3.1 Examples 

Here we present some examples of the application of the technique described 
in the previous Section. 

In the first example a set of 40 random signals with pulses of duration 
At = 10 and intensities +1 or —1 are generated. The total length of the 
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signal is 200 time units. These random signals form the typical data to 
which we will adapt the tomogram, by constructing the operator S in f!4ip . 
Fig.l shows 10 signals of this type. They all vary between +1 and —1, being 
shifted in the figure for clarity purposes. 
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Figure 1: A set of typical signals. 

Once the operator 5* is constructed, one considers the operator 

B(6) =tcos6 + Ssm6 

for which one computes the eigenbasis which is used to project the signals 
to be analyzed. To a pure signal of the same type as those used to construct 
the operator 5* (in the upper left panel of Fig.2), we add Gaussian noise 
(upper right panel of Fig.2). This signal is then analyzed and a tomogram 
constructed for 20 different values of 9 at intervals AO = 7r/40. A contour 
plot of the tomogram is shown in the lower left panel of Fig.2. As we have 
explained before, by inspection of the power distribution in the tomogram 
one may either select the components of intensity higher than a threshold for 
denoising of the whole signal or select particular components of signal. In 
this case what is of interest is to select the part of the signal that corresponds 
to the typical signals used to construct S. This is done by selecting only the 
strongest components at the region where they concentrate. In the lower 
right panel of Fig.2 we show the result of projecting on the eigenvectors 185 
to 200 at 6 19 = 19tt/40. 
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Figure 2: Signal, noisy signal, the tomogram and the projection on the eigen- 
vectors 185 to 200 at 9 = 19tt/40 

One sees that the signal is reasonably reconstructed from the noisy input. 
With a clipping operation at ±0.5 the reconstruction would be perfect. 

For the second example we have generated, as typical signals a set of 40 
sines with random frequencies. Fig. 3 displays some examples. 




20 40 60 80 100 120 140 160 180 200 

t 



Figure 3: A set of sines with random frequencies 
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This case is harder because noise (or interference) is expected to contain 
frequencies similar to the typical signals. Fig. 4 shows the results of a typical 
analysis of a signal that contains pieces of sines at different time intervals. 
In the upper left panel it is the signal, in the upper right panel the signal 
with Gaussian noise added, in the lower left panel the tomogram and in the 
lower right panel the result of the projection on the eigenvectors 189 to 197 
at 6 4 = 15tt/40. 



50 100 150 200 




50 100 150 200 




Figure 4: Signal, noisy signal, the tomogram and the projection on the eigen- 
vectors 189 to 197 at 9 = 19vr/40 

The projection range that was used aimed at including all the strongest 
components. Notice however that by selecting particular regions of the to- 
mogram one may extract particular components of the signal. For exam- 
ple, Fig.5 shows the result of projection on the eigenvectors 190 to 199 at 
9 = tt/10. 

For the third example we use data obtained from the Phoenix Mars Lan- 
der [27] . A dust devil is a hot whirlwind generated by a huge contrast between 
the martian atmospheric air and the planet surface. Dust devils appear in 
both temperature and pressure data as sudden drops with a duration be- 
tween two and three minutes. The upper left panel in Figure 6 shows some 
data from the Phoenix Mars Lander covering a 2000 seconds interval with a 
sampling rate of .5Hz. A dust devil is clearly visible at t ~ 800s as a drop 
in the pressure value. 
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-1.5 1 1 1 1 1 1 1 1 ' 1 

20 40 60 80 100 120 140 160 180 200 

t 

Figure 5: The projection on the eigenvectors 190 to 199 at 9 = n/10 which 
separates the second time component of the signal 

There has been several efforts to develop systematic methods to detect 
the effect of dust devils on the martian atmosphere data. They are based 
either on checking several ad-hoc conditions in the data [27] or on Field- 
Programmable Gate Arrays (FPGAs) [28] . 

To use our adapted tomographic filtering method for the detection of dust 
devils we have generated a set of 278 signals that resemble the shape that 
a dust devil produces on the data, that is a sudden drop of about 3% from 
the baseline, with different durations ranging from 60 to 80 time units. The 
upper right panel displays several of these typical signals. Some of the signals 
have been shifted up or down for representation purposes. 

As in the previous examples a tomogram is constructed for 20 different 
values of 9 at intervals A9 = it /AO. A contour plot of the first 999 coeffi- 
cients of the tomogram is shown in the lower left panel of Figure 6. Coeffi- 
cient n = 1000 corresponds to the biggest eigenvalue (and its corresponding 
eigenvector). This eigenvector contains most of the energy of the signal, and 
is several orders of magnitude bigger that any other coefficient, so clarity 
this coefficient has not been plotted in the tomogram. By direct inspection, 
we observe that, besides the coefficient n = 1000, the strongest components 
concentrate close to n — 400. The lower right panel in figure 6 shows the 
projection on the eigenvectors 340 to 450 and 1000 at 9 = 197r/40. One sees 
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that the pressure drop produced by the dust devil is very well reconstructed 
and separated from any other components present in the signal such as noise 
or smaller presure variations. 




Figure 6: Signal, typical signals, the tomogram (coefs 1-999) and the projec- 
tion on the eigenvectors 340 to 450 and 1000 at 9 = 197r/40 

As an alternative, that avoids the large value of the biggest eigenvalue, 
we may shift both the typical signals and the real data to zero mean sig- 
nals. In this case there is no eigenvalue much larger than all others. The left 
panel of figure 7 displays a 3D plot of the tomogram for the 1000 coefficients 
obtained with zero mean signals. We have also applied a denoising proce- 
dure, removing the small coefficients. The right panel in figure 7 shows the 
projection on the eigenvectors 340 to 450 at 9 = 197r/40. One sees that the 
pressure drop produced by the dust devil is completely separated from any 
other components in the signal. 
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Figure 7: Tomogram for mean typical signals and the projection on the 
eigenvectors 340 to 450 at 9 = 197r/40 

4 Conclusions 

Tomograms provide a two-variable characterization of signals which, due to 
its rigorous probabilistic interpretation, is robust and free of artifacts and 
ambiguities. For each particular signal that one wants to analyze, the choice 
of the appropriate tomogram depends not only on the signal but also on the 
features that we might want to identity or emphasize. 

We have developed a new family of data-driven tomograms that are com- 
bination of time and an operator obtained from a set of typical signals spe- 
cially tuned to represent the features that one wants to extract. These 
adapted tomograms provide, for noisy signals, filtering and separation of 
components and features that might not be well represented by the combi- 
nation of standard operators. 
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